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Abstract: This paper deals with the study of dependencies between two 
given events modeled by point processes. In particular, we focus on the 
context of DNA to detect favored or avoided distances between two given 
motifs along a genome suggesting possible interactions at a molecular level. 
For this, we naturally introduce a so-called reproduction function h that 
allows to quantify the favored positions of the motifs and which is considered 

"^ys \ as the intensity of a Poisson process. Our first interest is the estimation of this 

function h assumed to be well localized. The estimator h based on random 
thresholds achieves an oracle inequality. Then, minimax properties of h on 
Besov balls ;B| oo(-^) ^'^^ established. Some simulations are provided, allowing 
the calibration of tuning parameters from a numerical point of view and 
proving the good practical behavior of our procedure. Finally, our method is 
Kj^ . applied to the analysis of the influence between gene occurrences along the 

0^ \ E. coli genome and occurrences of a motif known to be part of the major 

^^ ■ promoter sites for this bacterium. 

'^ \ Keywords: Adaptive estimation, interactions model, oracle inequalities, 

t^^ '_ Poisson process, thresholding rule, [/-statistics, wavelets. 
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1 Introduction 



5h \ The goal of the present paper is to study the dependence between two given events modeled by point 
processes. We propose a general statistical approach to analyze any type of interaction, for instance, 
interactions between neurons in neurosciences or the comprehension of bankruptcies by contagion in 
economics. In particular, we focus on a model to study favored or avoided distances between patterns 
on a strand of DNA, which is an important task in genomics. 

We are first interested in the modeling of the influence between two given motifs, a motif being de- 
fined as a sequence of letters in the alphabet {a,c,g,t}. This alphabet represents the four nucleotides 
bases of DNA: adenine, cytosine, guanine and thymine. Our aim is to model the dependence between 
motifs in order to identify favored or avoided distances between them, suggesting possible interactions 
at a molecular level. Because genomes are long (some 1 million bases) and motifs of interest are short 
(3 up to 20 bases), motif occurrences can be viewed as points along genomes. For convenience, we 
work in a continuous framework and then, the occurrences of a motif along a genome are modeled by a 
point process lying in the interval [0;T], where T is the normalized length of the studied genome and 
will drive the asymptotic. We add that our model focuses on only one direction of interactions, that 
is to say we investigate the way a first given motif influences a second one. To study the influence of 
the second motif on the first one, we just invert their roles in the model. 



Laure Sansonnet 



We observe the occurrences of both given motifs (we presuppose interactions between them) and 
we assume that their distributions are as follows. The locations of the first motif are modeled by 
a n-sample of uniform random variables on [0;T], denoted Ui, . . . ,Un and named parents. As the 
parameter T, the number n of parents will also drive the asymptotic. Then, each Ui gives birth 
independently to a Poisson process N^ with intensity the function t i — > h{t — Ui) with respect to the 
Lebesgue measure on M (for instance, see |17|). which models the locations of the second motif. We 
consequently observe the aggregated process 

n n 

N = ^N' with intensity the function 1 1 — >^h{t- U) (1.1) 

and the points of the process N are named children. But in this model, for any child we do not observe 
which parent gives birth to him. The unknown function h is so-called reproduction function. Our goal 
is then to estimate h with the observations of the Ui's and realizations of N. 

Such a modeling of locations of the first motif is linked to the work on the distribution of words 
in DNA sequences of Schbath and coauthors (for instance, see |31| . [24] and [29j). Indeed, the first 
motif of interest is a rare word and is modeled by a homogeneous Poisson process N^ on [0; T]. Thus, 
conditionally to the event "the number of points falling into [0;T] is n", the points of the process N^ 
(i.e. the parents) obey the same law as a n-sample of uniform random variables on [0;T]. Moreover, 
with very high probability, n is proportional to T and this constitutes the asymptotic considered in 
genomics, to which we will refer as the "DNA case". With our model (considering a uniform law on 
the parents), we can also take into consideration the cases n <C T (parents are far away with respect 
to each other and one can almost identify which points are the children of a given parent) and n^T 
(parents are too close to each other, which leads to hard statistical problems). 

If n = 1, the purpose is to estimate the intensity of only one Poisson process. Many adaptive 
methods have been proposed to deal with Poisson intensity estimation. For instance, Rudemo |30) 
studied data-driven histogram and kernel estimates based on the cross-validation method. Donoho 
[8j fitted the universal thresholding procedure proposed by Donoho and Johnstone [9j by using the 
Anscombe's transform. Kolaczyk |18) refined this idea by investigating the tails of the distribution of 
the noisy wavelet coefficients of the intensity. By using model selection, other optimal estimators have 
been proposed by Reynaud-Bouret [25] or Willett and Nowak [32j . Reynaud-Bouret and Rivoirard [26j 
proposed a data-driven thresholding procedure that is near optimal under oracle and minimax points 
of view, with as few support assumptions as possible (the support of the intensity h may be unknown 
or not finite), unlike previous methods that need to assume that the intensity has a known bounded 
support. 

We notice that the reproduction function h can be also viewed as the intensity of a Cox process 
(for instance, see [5]) where the covariates are the parents Ui, . . . ,Un- Comte et al. [3] proposed an 
original estimator of the conditional intensity of a Cox process (more generally, a marker-dependent 
counting process). Using model selection methods, they prove that their estimator satisfies an oracle 
inequality and has minimax properties. Note that we consider here point processes on the real line. 
Some aspects of similar spatial processes are studied in a parametric way |23j . for instance. 

Some work has been done to study the statistical dependence between motif occurrences. For 
instance, in Gusto and Schbath's article [12], the framework consists in modeling the occurrences of 
two motifs by a Hawkes process (see [13|): our framework can be viewed when the support of h is in 
]R_l_ as a very particular case of theirs. Their method, called FADO, uses maximum likelihood estimates 
of the coefficients of /i on a Spline basis coupled with an AIC criterion. However, even if the FADO 
procedure is quite effective and can manage interactions between two types of events, spontaneous 
apparition (a child can be an orphan) and self-excitation (a child can give birth to another child), 
there are several drawbacks. In fact, this procedure is a parametric estimation method coupled with a 
classical AIC criterion which behaves poorly for complex families of models. Moreover, FADO involves 
sparsity issues. Indeed, our feeling is that if interaction exists, say around the distance d bases, the 
function h to estimate should take large values around d and if there is no biological reason for any 
other interaction, then h should be null anywhere else. However, if the FADO estimate takes small 
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values in this last situation, it does not vanish, which can result in misleading biological interpretations 
(see |27|). Finally, in this model, the occurrences of the first motif only depend on the past occurrences 
of first and second motifs. Reynaud-Bouret and Schbath |27) have proposed an alternative based on 
model selection principle for Hawkes processes that solves the sparsity problem. Their estimate satisfies 
an oracle inequality and has adaptive minimax properties with respect to certain classes of functions. 
But it manages only one motif whereas it is necessary to treat interaction with another type of events 
and the method has a high computational cost even for a small number of models. Note that Hawkes 
processes have a long story of parametric inference (see [22], [21] and [^). In particular, for genomic 
data, Carstensen et al. |2] recently deal with multivariate Hawkes process models in order to model 
the joint occurrences of multiple transcriptional regulatory elements (TREs) along the genome that 
are capable of providing new insights into dependencies among elements involved in transcriptional 
regulation. 

In this paper, the proposed model is simple. Each child comes from one parent (no orphan and 
no child who is a parent), that is to say we do not take into account the phenomenons of sponta- 
neous apparition and self-excitation, contrary to Hawkes process models. But it brings novelties. To 
estimate the reproduction function h, we propose a nonparametric method, using a wavelet thresh- 
olding rule that will compensate sparsity issues of the FADO method. Furthermore, our model treats 
interaction between two types of events, with a possible influence of the past occurrences but also 
future occurrences. Then, there is the presence of a double asymptotic: the normalized length of the 
studied genome T and the number n of parents, which is not usual. In the biological context, it is not 
acceptable assuming to know each child's parent. Our model, via the reproduction function h, allows 
to quantify the favored locations of children in relation to their parent, even if one cannot attribute a 
child to a parent before the statistical inference. First we provide in this paper theoretical results and 
we derive oracle inequalities and minimax rates showing that our method achieves good theoretical 
performances. The proofs of these results are essentially based on concentration inequalities and on 
exponential and moment inequalities for [/-statistics (see [6], [11] and [14j). Secondly some simulations 
are carried out to validate our procedure and an application on real data {Escherichia coli genome) 
is proposed. The procedure provides satisfying reconstructions, overcomes the problems raised by 
the FADO method and agrees with the knowledge of the considered biological mechanism. For these 
numerical aspects, we have used a low computational complexity cascade algorithm. 

In Section 2, we define the notations and we describe the method. Then Section 2 discusses 
the properties of our procedure for the oracle and minimax approaches. Section 3 is devoted to the 
implementation of our method and provides simulations. The cascade algorithm is presented in Section 
3.1. Section 4 presents the application on the complete Escherichia coli genome. A more technical 
result that is at the origin of the one stated in Section 2.3 and proofs can be found in Section 6 
(Appendix) . 

2 General results 

2.1 Notations 

To estimate the reproduction function, we assume that h belongs to i(M) and oo(IK)- Consequently, 
we can consider the decomposition of /i on a particular biorthogonal wavelet basis, built by Cohen et 
al. [3], that we can describe as follows. We set (/> = lfo,il the analysis father wavelet. For any r > 0, 
there exist three functions ■(/;, (p and tp with the following properties: 

• (p and tp are compactly supported, 

• (p and ip belong to C^~^^, where C^~^^ denotes the Holder space of order r -\- 1, 

• ip is compactly supported and is a piecewise constant function, 

• ^ is orthogonal to polynomials of degree no larger than r. 
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|((/>fe,'0i,fc)j>o,fcez, {4'k,tpj,k)j^o,k&j is a biorthogonal family: for any j,j' ^ 0, for any k,k' G 

(t>k{x)ipj',k'{x) dx = ipj^k{x)4^k'{x) dx = 0, 



<t}k{x)(lik'{x)dx = l{k=k'}: j iJj,k{x)iJj',k'{x)dx = lsj=j,^k=k'}, 
where for any x G M, 

Mx) = Hx - k), ijj,kix) = 2^'/2^(2ix - k) 

and 

Mx) = kx - k), iPj,k{x) = 2^'/2^(2Jx - k). 

On the one hand, decomposition wavelets (pf^ and 'i/'j ^ are piecewise constant functions and, on the 
other hand, reconstruction wavelets (pk and tpj^k are smooth functions. This implies the following 
wavelet decomposition of h £ 2 (IK): 

h = ^ak4>k + ^^Pj,k'ijJj,k, (2.1) 

fcez j^o fcez 

where for any j ^ and any k £ Z, 

Oik = h{x)(t>k{x) dx, Pj,k= I h{x)'ipj^k{x) dx. 

The Haar basis, used in practice, can be viewed as a particular biorthogonal wavelet basis, by setting 
4> = (p and tp = ip = lii.]^i — Ifo-ii) with r = (even if the second property is not satisfied with such 
a choice). The Haar basis is an orthonormal basis, which is not true for general biorthogonal wavelet 
bases. This kind of decomposition has already been used in thresholding methods by Juditsky and 
Lambert-Lacroix |16| . Reynaud-Bouret and Rivoirard [26) . and Reynaud-Bouret et al. [28j. 
To shorten mathematical expressions, we set 

A = {X={j,k):j;?-l,k€Z}, 

and for any A G A, 

^r 0fc ifA = (-l,A;) ~ =[ k ifA = (-l,fc) 

'^^ \ ^j- fc if A = (j, k) with i ^ ' '^^ \ V^^- fc if A = (i, k) with j ^ 

and similarly 

at ii\ = {-l,k) 



^^ 1 /3j, fc if A = (j, k) with i ^ 



Then (j2.ip can be rewritten as 

/i = ^ /3a (^A with /3a = /" /i(x)(^A(x) dx (2.2) 



AgA 



and now, we have to estimate these wavelet coefficients. 
For all A in A, we define /3a an estimator of /3a as 



/3a = ^^, with GM= fJ2 



71 — 1 

^x{t-Ui) E^((^A(t-f/)) 

n 



dNt, (2.3) 



where tt is the uniform distribution on [0;T] and KT^{{p\{t — U)) denotes the expectation of ipx{t — U) 
where [/ ~ vr (an independent copy of C/i, . . . , [/„)• If n = 1, we obtain the natural estimators of the 
P\s in the case of only one Poisson process on the real line (see [26] ). 



Nonparametric estimation in a Poissonian interactions model 

Lemma 2.1. For all A = {j, k) in A, 

E(G(<^a)) = "■ / <fx{x)h{x)dx, 

Jm. 

i.e. f3\ is an unbiased estimator for (3\. Furthermore, its variance is upper hounded as follows: 

Var(/3.)^c{i + l + ^ 



and 



where C and C depend on 



(I n 

supVar(/3A)^C'<^- + -2 
AeA I n J ^ 



and 



The behavior of the variance of the /3a 's is not usual, because two parameters n and T are involved. 
Nevertheless, when n is proportional to T ("DNA case" as explained in Introduction), the variance is 
bounded by 1/T up to a constant, as for the Hawkes process (see [27]). When n <^T, the variance 
is bounded by 1/n up to a constant, which means that the apparition's distance between two parents 
is large enough to make their interactions insignificant for the statistical analysis. So in this case, 
our framework can be viewed as the observation of a n-sample of a Poisson process with common 
intensity h (see |26j). Finally, when n ^ T, the variance deteriorates and is only bounded by n/T'^ 
up to a constant, and in this case, the small apparition's distance between two parents leads to rough 
statistical issues hard to overcome. 

2.2 Description of our method 

We start assuming that h is compactly supported in [— ^;^], with A a positive real number. This 
quantity A can denote the maximal memory along DNA sequences (this is chosen by the biologists (see 
|12|). depending on the underlying biological process they have in mind). Furthermore, the properties 
of the biorthogonal wavelet bases introduced previously allow us to assume that we know a positive 
real number M such that the support of ip is contained in [— M; M]. 
First, we introduce the following deterministic subset P of A 

r = {X = {j,k)eA:-l^j^ jo, kelCj}, 

where jo a positive integer that will be fixed later and at each resolution level j, we denote ICj the 
set of integers such that the intersection of the supports of ip\ and h is not empty, with A = {j,k). 
Straightforward computations lead to a cardinal of P of order 2-"' . 

Then, given some parameter 7 > 0, we define for any A G P, the threshold 



,.(7,A) = ,/2™F(^) + fB(^)+Ai!S 



\ n 



n 



(2.4) 



where A is a positive quantity and N^ is the number of points of the aggregated process A^ lying in 
M. For theoretical results, A will be taken of order -'^ h -^ + ^y" times a constant depending on 

7, llV'lli, 



2 and llV'lloo- In (|2.4p . we set 



B 



"Px 



n 



-B{ipx) = - 
n n 



E 



Ti — 1 

Vx{--Ui) E^{ipx{--U)) 



n 



(2.5) 



and 



V 



fx\ 



n / n- 



:y{^x) 



n^ 



Vi^x) + J2jJoViipx)B^iipx) + 3^JoB\ipx, 



(2.6) 
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where 



y{v> 




Vxit - U^) 



n 



1 



n 



E^iifxit-U)) 



dNt. 



(2.7) 



Since they only depend on the observations, the numerical values of B{ipx), Vifx) and so V{lpx) 
defined respectively by ()2.5p . (|2.7p and ()2.6p can be exactly computed. 

We denote /3 the estimator of /3 = (/3A)AeA associated with the previous thresholding rule: 



^- (/^Al|/3A|^r,,(7,A)lAer 



AeA 



(2.8) 



(2.9) 



and finally, we set 

AeA 

an estimator of h that only depends on the choice of (7, A) and jo fixed later. 

Thresholding procedures have been introduced by Donoho and Johnstone [9]. They derive from the 
sufficiency to keep a small amount of the coefficients to have a good estimation of the function h. The 



threshold rjx{'y, A) seems to be defined in a rather complicated manner but the first term: \/2jJqV (^) 
looks like the universal threshold proposed by [9] in the Gaussian regression framework, by choosing 
7 close to 1 and Jq of order logn. The universal threshold of [9] is defined by rjx = y2o"^logn, 
where a'^ (assumed to be known) is the variance of each noisy wavelet coefficient. In our setting, 
Var(/3A) depends on h, so it is (over) estimated by V (^)- The other terms of the threshold ()2.4p are 
unavoidable remaining terms which allow to obtain sharp concentration inequalities. 



2.3 Main result and discussions 

Our main result is an oracle one. Given a collection of procedures (for example, penalization, projection 
or thresholding), the oracle represents the ideal "estimator" among the collection. In our setting the 
oracle gives, for our thresholding rule, the coefficients that have to be kept. In our framework (see [S] 
and |26) ) . the "oracle estimator" is 



h = ^^x^x, with ^a = /3a1 



Var{/3;,)</32 



Aer 



This "estimator" is not a true estimator, of course, since it depends on h. The approach of optimal 
adaptation is to derive true estimators which achieve the same performance as the "oracle estimator". 
Our goal is now to compare the risk of h defined in Section 2.2 to the oracle risk: 



E(||^-/i||i)=E^E[(/5Alvar(/3.)</3j 



/3a)'] + E /^a = E niin(Var(/3A), /3l) + E A 



Aer 



A^r 



Aer 



A^r 



Theorem 1. We assume that n ^ 2, jo G N* such that 2-'" < n < 2-'"+-'^, 7 > 21og2 and A is defined 
in the Appendix by \6.15\) and i6.16\) . Then the estimator h defined in Section 2.2 satisfies 



E{\\h-h\\l)^CiMA E/3I + 



\4m 



{\ognf X - + {\ognf X —r 
n i ^ 



\m\ 



+ C2 



1 n 

n r^ 



where |m,| is the cardinal of the set m, Ci is a positive constant depending on 7, ||/i||i 
||'0||2 and 1 1 V' 1 1 00 o-nd C2 is a positive constant depending on the compact support of h, ||/i||i 
compact support ofip, \\il^\\i, \\ip\\2 o-nd HV'lloo- 



00, IIV^IIl; 

, the 



As the expression between brackets is of the same order as the upper bound of Yar(f3x) established 
in ILemma 2.11 (up to a logarithmic term), the oracle type inequality of [Theorem II proves that the 
estimator h achieves satisfying theoretical properties. 
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In particular, if we apply [Theorem IJ with n proportional to T ("DNA case"), then the estimator 
h defined in Section 2.2 satisfies 



K{\\h-h\\l)^C,MAT.^l + 



(logT) 




mcr , 

This oracle type inequality is similar to the one obtained by Theorem 1 of |27) where the Hawkes process 
is considered. Since n is proportional to T, this inequality is typical of classical oracle inequalities 
obtained in model selection (for example, see Theorem 2.1 of [26] where only one Poisson process on 
the real line is considered or more generally, see |20| for density estimation). 

Then, we establish a minimax result on Besov balls still with n is proportional to T. For any 
R > and s G M such that < s < r + 1 (where r > denotes the wavelet smoothness parameter 
introduced in the description of the biorthogonal wavelet bases at the beginning of the current section) , 
we consider the following Besov ball of radius R: 



BlooiR) 



f G 2(K) ■.f = Yl /3a<^a, Vi ^ -1, Yl f^lk) < ^'2-''' 



> . 



AgA k^Kj 

Now, let us state the upper bound of the risk of h when h belongs to B2 ooi^)- 

Corollary 2.1. Let R > and s G M such that < s < r + 1. Assume that h G B2oo{R) ^^d n is 
proportional to T. Then the estimator h defined in Section 2.2 satisfies 



e(ii/..-/.iiO«c(<!^)"" 



where C is a positive constant depending on 7, the compact support of h, \\h\\i, \\h\\oo, the compact 
support of tjj , IIV'lli, llV'lb, IIV'lloo and R. 

The rate of the risk of h corresponds to the minimax rate, up to the logarithmic term, for estimation 
of a compactly supported intensity of a Poisson process (see |25| ) or for a compactly supported density 
when we have n i.i.d. observations (see [lOj). One more time this illustrates the optimality of the 
procedure h but in the minimax setting. 

3 Implementation procedure 

From now on we consider the context of DNA, i.e. n is proportional to T. As mentioned in Introduction, 
we can assume that the parents are the points of a homogeneous Poisson process N^ on [0; T] with 
constant intensity // which allows to write n ~ /xT. 

In this section, we specify a procedure for the computation of the family of random thresholds 
{rjx{'j,A))xi^Y' to reconstruct the reproduction function h. We also provide some simulations in order 
to calibrate parameters from a numerical point of view and to show the robustness of our procedure. 

3.1 Algorithm 

We only focus on the Haar basis where 

</> = (A = l[0;i] and V = ■0 = l]i;i] -l[o;i]> 

because the expression of the functions associated to this basis, that are piecewise constant functions, 
allows to implement simple and fast algorithms. Furthermore, considering this kind of functions is 
suitable for our genomic setting. In fact, according to biological studies, the reproduction function h 
is expected to be very irregular, with large null ranges and sudden changes at specific distances. We 
recall that h is assumed to be compactly supported in [—A; A], with A a positive integer in practice. 
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We consider the thresholding rule h defined in Section 2.2 with 

T = {X = {j,k)eA:-l^j^ jo, kelCj}, 
and 



\ n J 3 \ n J \/t n 

Observe that 7]x('j,5) slightly differs from the threshold defined in (|2.4p since the parameter A is 
replaced with -j= (thanks to the definition (|6.15p of A) and V{(p\) is now replaced with V{(px) (there 
is no major difference in our simulations). The ideal choice (from a theoretical point of view) of the 
maximal resolution level Jq is given by [Theorem 1\ that is to say Jq is the positive integer such that 
2^° ^ n < 2^°~^^. But we will fix jo = 5 in the sequel (in particular, to limit the computation time). 
The choice of the parameters 7 and 5 is discussed in the next subsection. 
A key point of the algorithm is the computation of the quantity 



s{^xm = Yl 



i=l 



Tl — 1 

Mt - Ui) ^Avx{t - u)) 

n 



for all t £ 



that appears in (3x, B{ipx) ^-nd V{ipx)- We decompose it into two parts: a random "piecewise constant" 

n 

part Sr{y^x) = / y^\{' ~ Ui) and a deterministic (piecewise affine) part (n — l)¥,T^{ipx{t — U)). Note 



i=l 



that the deterministic part can easily be implemented with a low computational cost. This is not 
the case of the random "piecewise constant" part for which we have constructed a cascade algorithm, 
inspired by the pioneering work of Mallat |19) . To explain this algorithm in few words, we use the 
following notations: for any j ^ 0, for any k € Z, for any j; S M, 

0,,fc(x) = 2^/^(t>i2^x - k), Vi,fc(a;) = 2JV2^(2i^ _ fc), 

where (pj^i^ are father wavelets and ipj^k mother wavelets. We have the following relationships between 
wavelets at level j and wavelets at level (j + 1): 

\/2 \f2 

'^j,k = -^{<Pj+i,2k+i - 4>j+i,2k) and (j)j^k = -^(<;/'i+i,2fc + <Pj+i,2k+i)- (3.1) 

We notice that only mother wavelets and the father wavelet of level j = (that corresponds to ipx with 
A = (— 1,A;)) are used to reconstruct the signal. The cascade algorithm is implemented as follows. 

1. Compute Sr{4>j^fl)- Since Sr{4>JQfi) is a piecewise constant function, this computation gives a 
partition and the values of Sr{4>JQfi) on the intervals of the partition. 

2. Shift by +2~^'^k the intervals of the previous partition by keeping the same values on the partition 
to obtain Sr{4>JQ,k) for any integer k in [—2^" A; 2^° A — 1]. 

3. For any resolution level j going from (jo — 1) to 0, in a decreasing way, compute Sr{tpj,k) and 
Sr{(t>j.k) with expressions (j3.ip . The quantities Sr{ipj.k) allow the reconstruction of the signal 
and the quantities Sr{(pj^k) are transitional and will be used for the computations of the lower 
resolution level (j — 1). 

4. Also keep Sr{4>Q,k) because it is used for the reconstruction of the signal. 
Now, let us define our thresholding estimate of h for a practical purpose. 

Step Let jo = 5 and choose also positive constants 7 and 5. 

Step 1 Set r = |A = (j. A;) G A : — 1 ^ j ^ jo. A; G /Cj} and compute for any A in F, S{lpx){X) for all 
points X of the process N . In the same way, also compute the coefficients /3a, B{ipx) and V{(px)- 
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Step 2 Threshold the coefficients by setting f3x = (3x1 
choice: 



^a( 



<-^)-\/^™n^) -?-(?)+ 



lo 1^ r s\ accordine; to tlie following threshold 



3 \ n J \/t n 



Step 3 Reconstruct the function h by using the f3x^s and denote 

h = '^^x'Px- 
agA 

3.2 Experiments on simulated data 

The programs have been coded in Scilab 5.2 and are available upon request. 

3.2.1 Choice of parameters 

Now, we deal with the choice of the parameters 7 and 6 in our procedure from a practical point of 
view. The question is: how to choose the optimal parameters? We work with two testing functions 
denoted 'Signall' and 'Signal2' whose definitions are given in the following table: 




with v, the children's intensity, set to 4. We fix willfully A = 10. Such a choice of A (remember that 
[—A; A] is the support of h) assumes that we do not know the support of functions. We recall that 
Jo = 5. 

Given T, fj, the parents' intensity and a testing function, we denote R{'y, 6) the quadratic risk of 
our procedure h (depending on (7,5)) defined in Section 3.1. Of course, we aim at finding values of 
(7, S) such that this quadratic risk is minimal. The average over 100 simulations of R{'y, 5) is computed 
providing an estimation of E{R{-y,5)). This average risk, denoted R{'y,5) and viewed as a function 
of the parameters (7,(5), is plotted for (T,//) G {(10000, 0.1), (2000,0.1), (2000, 0.5)} and for the two 
signals considered previously: 'Signall' and 'Signal2'. 




Figure 1: The function (7,(5) H' i?(7,5) for 'Signall' and 'Signal2' for different values of T and fj,: 'Signall' 
in m and 'Signal2' in ■ with {T,fi) = (10000,0.1); 'Signall' in ■ and 'Signal2' in ■ with {T,fi) = (2000,0.1); 
'Signall' in ■ and 'Signal2' in with {T,fj,) = (2000,0.5). 
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Figure 1 displays R for 'Signall' and 'Signal2' decomposed on the Haar basis. This figure allows to 
draw the following conclusion: for any (r,/x) G {(10000,0.1), (2000,0.1), (2000,0.5)} and for 'Signall' 
or 'Signal2', 

i?(7,5)«0 
for many values of (7,5). So, we observe a kind of "plateau phenomenon". 



Reconstructions of the intensities of 'Signall' and 'Signal2' are respectively given in Figure 2 and 



Figure 3 with the choice (7,(5) = (0.18,2.4), a common value of several plateaus. Note the good 
performance of our thresholding rule, in particular for T = 10000 and // = 0.1 (we have ^T = 1000 
parents and ^vT = 4000 children in average), which corresponds to the real case treated in Section 4. 
Thus, we propose to take systematically (7,(5) = (0.18,2.4) in our procedure h defined in Section 3.1. 
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Figure 2: Reconstructions of 'Signall^ (true: dotted line, estimate: solid line): left: (T^fi) = (10000,0.1); 
middle: (T.fi) - (2000,0.1); right: {T,ij) = (2000,0.5). 



i—i—i— I— I— i—i— I— I— i—i— I— I— I— 1 
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Figure 3: Reconstructions of 'Signal2' (true: dotted line, estimate: solid line): left: {T,ii) = (10000,0.1); 
middle: {T,fi) = (2000,0.1); right: (T, /i) = (2000,0.5). 
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3.2.2 About the support of h 

We are interested in the robustness of our procedure with respect to the support issue from a numerical 
point of view. What happens if we are wrong about the support of the function that we want to 
estimate? For instance, we consider the testing function denoted 'Signals' whose definition is given in 
the following table: 




with z^, the children's intensity, set to 4. 

[Figure 4 displays reconstructions of 'Signals' with different supports of h: [—A; A], with A G 
{1,5, 10}. This figure shows that when we take a not large enough support (A = 1 or 5), we do not 
make large errors of approximation on [— ^;^]. So, the procedure seems to take into account what 
happens beyond the chosen support. And for A = 10, we have a good complete reconstruction of 
'Signals'. 



i.cr 

0.5- 




n 




-n..'T 



















0.5- 





































0.5- 





































Figure 4: Reconstructions of 'Signals ' (true: dotted line, estimate: solid line) with different supports: top: 
A= 1: middle: A — 5; bottom: A = 10. 



Finally, even if the support of the reproduction function is unknown, our method estimates correctly 
the signal on the chosen support, which explains the robustness of our procedure with respect to the 
support issue. 



3.2.3 The case of spontaneous apparition 

Here, we investigate the case of spontaneous apparition. Even if our model does not take into account 
the spontaneous apparition (i.e. children can not be orphans), we are interested by the performance of 
our procedure if there is a presence of orphans. On the one hand, let us give two processes: a process 
of intensity 'Signall' with u = 3, T = 10000 and /i = 0.1, to which is added a homogeneous Poisson 
process on [0;T + 1] with intensity /i(4 — z/) = 0.1 (the orphans are viewed as a Poissonian noise). 
Thus, we have in average 1000 parents, 3000 children having a parent and 1000 children being orphans. 
On the other hand, let us give two other processes: a process of intensity 'Signall' with v = 1 this 
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time, T = 10000 and fi = 0.1, to which is added a homogeneous Poisson process on [0; T + 1] with 
intensity /u(4 — z^) = 0.3. Thus, we have in average 1000 parents, 1000 children having a parent and 
3000 children being orphans. 

Reconstructions of 'Signall' with z^ = 3 and 1^ = 1 are given in [Figure 5[ When there is a small 
proportion of children being orphans, the reconstruction is still acceptable; the procedure can manage 
few orphans. But, when there are too many orphans, our procedure makes approximation errors, which 
are due to the fact that our model consists in associating any child with a parent. 
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-1.0 -0.5 0.0 0.5 1.0 1.5 2. 



Figure 5: Reconstructions of 'Signall' (true: dotted line, estimate: solid line) with different values of v: left: 
V = 3; right: v = \. 



We mention that the case of spontaneous apparition is only numerical. For a more precise study of 
this phenomenon, we should extend our model by adding a positive constant to the intensity function 
1 1 — > Z]r=i ^(* ~ ^«)' that would represent the orphans. This is outside the scope of this paper. 



4 Applications to genomic data 

As application, we are interested in the Escherichia coli genome. E. coli is an intestinal bacterium in 
mammals and very common in humans which is widely studied and used in genetics. More precisely, 
we are interested in the study of the dependence between promoter sites and genes along the complete 
genome of the bacterium. In particular, promoters are usually structured motifs located before the 
genes and not too far from them. Here, we have considered the major promoter of the bacterium E. coli 
and more precisely the word tataat. Most of the genes of E. coli should be preceded by this word 
at a very short distance apart. In order to validate our thresholding estimation procedure (proposed 
at Section 3), we hope to detect short favored distances between genes and previous occurrences of 
tataat. 

For this, as in |12] we have analyzed the sequence composed of both strands of E. coli genome 
(4639221 bases); each strand being separated by 10000 artificial bases to avoid artificial dependencies 
between occurrences on one strand and occurrences on the other strand; we took 10000 bases for the 
maximal memory. It then represents a sequence of length 9288442; there are 4290 genes (we took the 
positions of the first base of coding sequences) and 1036 occurrences of tataat. For convenience, we 
set T = 9289 and so A = 10 (we work on a scale of 1 : 1000). We recall that we have fixed jo = 5 and 
taken (7,5) = (0.18,2.4). 
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First, we investigate the way the DNA motif tataat influences genes and so, in our model, the 
parents are the occurrences of tataat and children are the occurrences of genes. To give general insight 



on h, Figure 6 gives the estimator h defined in Section 3.1 without Step 2 (no thresholding), i.e. we 
have kept all the estimated coefficients. We observe a peak around which corresponds to what we 
thought about the fact that most of the genes of E. coli should be preceded by the word tataat at a 
very short distance apart. We also observe other peaks, for instance around 1200 bases. The biological 
significance of these peaks remains an open question. 



-9 o 




Figure 6: Estimator, no thresholding, for E. coli data at the scale 1 : 1000 (i.e. 1 corresponds to 1000 bases), 
with parents = tataat and children = genes. 



We apply the complete procedure proposed in Section 3.1 (with thresholding) and we obtain 



Figure 7 The shape of this estimator explains how occurrences of genes are infiuenced by occurrences 
of tataat. We can draw following conclusions, that coincide with the ones we could expect: 

• The estimator h[t) = if t ^ and t ^ 500. It means that for such t's, gene occurrences seem 
to be uncorrelated of tataat occurrences. 

• Conversely, if t G [0; 500], h{t) > 0, meaning that short distances are favored; smaller the distance, 
higher is the infiuence. 




Figure 7: Estimator h defined in Section 3.1 for E. coli data at the scale 1 : 1000, with parents = tataat and 
children= genes. 



Then, we investigate the way genes infiuences the DNA motif tataat and so, in our model, the 



parents are the occurrences of genes and children are the occurrences of tataat. Figure 8 gives the 
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estimator h defined in Section 3.1 (with (7,(5) = (0.72,2.4)). The shape of this estimator explains how 
occurrences of tataat are influenced by occurrences of genes. We can draw fohowing conclusions, that 
is completely coherent with biological observations: 

• When t ^ —500 and t ^ 1000, h{t) = 0. It means that for such i's, tataat occurrences seem to 
be uncorrelated of gene occurrences. 

• When t G [— 500;0], h{t) > 0, meaning that there is a preference having a word tataat just 



before the occurrence of a gene. It corresponds to the same conclusions drawn from Figure 7 
(second point). The motif tataat is part of the most common promoter sites of E. coli meaning 
that it should occur in front of the majority of the genes. 

When t G [0; 1000], h{t) < 0; occurrences of tataat are avoided for such distances t. Genes on 
the same strand do not usually overlap and they are about 1000 bases long in average: this fact 
can explain this conclusion. 




Figure 8: Estimator h defined in Section 3.1 for E. coli data at the scale 1 : 1000, with parents=genes and 
children= tataat . 



Finally, Figure 9 presents the results of the FADO procedure [12j and Figure 10 presents the results 
of the Islands procedure of |27) . For the FADO procedure, we have forced the estimators to be piecewise 
constant to make the comparison easier. Our results agree with the ones obtained by FADO and 
Islands. But our method has advantage to point out that nothing significant happens after a certain 
distance (contrary to the FADO procedure), has advantage to treat interaction with another type of 
events (contrary to the Islands procedure) and has advantage to deal with the dependence on the past 
occurrences but also on the future occurrences (the function h is supported in M-|- for the two other 
procedures). For algorithmic reason, a practical limitation of our method is that we only consider 
piecewise constant estimators (as for the Islands procedure), but it is enough to get a general trend on 
favored or avoided distances within a point process. 
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FADO; m = 12 



FADO:m = 15 



Figure 9: FADO estimators for both E. coli datasets: left: tataat; right: genes. 



!- 



Figure 10: Islands estimators for both E. coli datasets: left: tataat; right: genes. 



5 Conclusion 

In our paper, we have investigated the dependencies between two given motifs. A random thresh- 
olding procedure has been proposed in Section 2.2. The general results of Section 2.3 have revealed 
the optimality of the procedure in the oracle and minimax setting. Our theoretical results have been 
strengthened by simulations illustrating the robustness of our procedure, despite a calibration of pa- 
rameters from a practical point of view that differs from the theoretical choice. Section 4 has validated 
the procedure with a good detection of favored or avoided distances between occurrences of tataat 
and genes along the E. coli genome. 

Further extensions of our model could be investigated. First, we could consider a more sophisticated 
model that takes into account the phenomenons of spontaneous apparition and self-excitation (as for 
the complete Hawkes model). But this model raises serious difficulties from the theoretical point of 
view. This is an exciting challenge to overcome them. Secondly, we could extend our cascade algorithm 
to general wavelet bases and not only to Haar bases. Finally, it is also relevant to study similar processes 
in the spatial framework and to connect them, for instance, to the Neymann-Scott process (see Section 
6.3 of [5]), which is a stimulating topic we wish to consider. 



Acknowledgments: The author wishes to thank Sophie Schbath for the two genomic data sets used 
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6 Appendix: Proof of [Theorem 1 



In the sequel, the values of the constants K, K' ,Kq,Ki, K2, K-^, . . . may change from line to line. For 
the sake of clarity, the proofs are fully detailed in this appendix. 

6.1 A more general result 

We first give a general result stated and proved in |26| . 

Theorem 2 (Theorem 2.2 of [26j). To estimate a countable family (3 = {(3x)x£A, such that WfSWi^ < 00, 
we assume that a family of coefficient estimators {f3\)x^r, where T is a known deterministic subset of 
A, and a family of possibly random thresholds (r?A)Aer ^i"^ available and we consider the thresholding 
rule 
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Let £ > be fixed. Assume that there exist a deterministic family (H\)\Qr cind three constants k G [0; 1[, 
a; G [0; 1] and C > (that may depend on e but not on X) with the following properties: 

(Al) For all A in T, 

p(|/3a-/3a| >^rjx) ^uj. 

(A2) There exist 1 < p,q < oo with p + ^ = 1 o-iT'd a constant R > such that for all X in T, 



E I /3a -ft 



|2p 



" !^Rmax<Hx,H^ei 



1 

'-'-'A ^ 



(A3) There exists a constant such that for all X in T such that H\ < 9e, 

P (|/3a - /3a| > f^Vx, I/3a| > V\) < HxC- 
Then the estimator (3 satisfies 

y (^ \<^m xem xem J J Aer 

with LD = ^ ((1 + 0-1/9)^1/9 + (1 + 0Vg)£i/9^i/'?) . 

Using the previous theorem, we estabhsh the foUowing result that we wih prove in Section 6.4. 

Theorem 3. Let n ^ 1, jo G N*, 7 > and A defined by l[6.15\) and l[6.16\) . Then the estimator /3 
defined by \2.8\) in Section 2.2 satisfies 

E (||/3 - /3||i) ^ Ci mf^ I E /^A + i^(jo, n, T)\m\ \ + C2i?(e-'=^^»^/2 + ^-MMrl^-^^^^ ^ 

™^ I A^m J 

where C\ is a positive constant depending on 7, \\h\\i, \\h\\oo, HV'lli; llV'lb f^d ||V'||oo) C2 is a positive 
constant depending on the compact support of h and the compact support of ip, 

/ 1 2^0/2 2^V2 n \ 

^ - ^« |- + ;^ + ;^^T7^ + ^ J' 

withCpi a positive constant depending on \\h\\i, ||/i||oo; ihe compact support ofip, ||V'||i; llV'lb o-nd llV'lloo 
and Ki and K2 are absolute constants in ]0; 1[. 

To obtain [Theorem T1 we consider n ^ 2, we take Jq the positive integer such that 2^° ^ n < 2^'^^^ 
and 7 > 2 log 2 in the previous theorem. Therefore, we note that 

{logn (logn)'^/2ni/2 (logn)"!?! (logn)^ (logn)^/2ni/2 (log n)n 1 

J logn (logn)^/2 (logn)^ (logn)^ (logn)^/^ (log n)n 1 
^ I^T^ ;^ + ^i^ + T ^ nV2Ti/2 + r2 j 

^ ^ j (logn)^ (logn2)ra | 
^ I n T2 (' 
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by comparing all the terms of the right-hand side between them (for this, we distinguish the cases 
n ^T and n ^ T), with K an absolute positive constant (that changes from line to line) and 



with K' an absolute positive constant. Moreover, e '^i-Jo')'/22-?o is bounded thanks to the choice of 7. 
Finally, since 

fh-h\\l^Ko\\^-f^\\l, 

with Kq a positive constant depending only on the functions that generate the biorthogonal wavelet 
basis, we establish [Theorem II 



6.2 Technical lemmas 

Before proving [Theorem 31 we establish two lemmas which we will use throughout the proof. 
Lemma 6.1. 

(a) For any function f in 2(1^) and for all t G M, Var,r(/(i — U)) ^ — / / (x) dx, where 
Var^(/(t — U)) denotes the variance of f{t — U) where f7 ~ vr. 

(b) For any function f in i(M) and for all t G R, / ^^^{f (t — U)) dt = / f{x) dx. 



1 f 

(c) For any nonnegative function f in i(M) and for all t G M, K.,^{f{t — U)) ^ — / f{x) dx. 

T Jm. 

Proof. 

(a) Let / G 2(K) and t G M. 

Var,(/(t - [/)) ^ EM\t -[/)) = !/ f\t -u)du^^ [ f\x) dx. 

(b) Let / G i(M) and t G M. 
E^(/(t - U)) dt = EJ [ fit -U)dt] =e( [ fix) dx] = [ fix) dx. 



I F ., . . 1 



(c) Let / G i(M) such that / ^ and t G 

EM{t-U)) = ^ I fit-u)du^^ I fix)dx. 

^ Jo -l- JR 

D 

The next result is a Rosenthal type inequality for any Poisson process, that extends Lemma 6.2 of 
]• 
Lemma 6.2. Letp ^ 1. Consider a Poisson process N on (X, Af) a measurable space, with a finite mean 

measure u : X \-^ M+ and a function c^ : X 1— )• M which belongs to 2p{'^)- We denote $ = ^i^) dNx a 



natural estimator of (3 = ipix) duix) that satisfies E(/3) = /?. Then, there exists a positive constant 
Cip) only depending on p such that 

E(|/3 - /3|2P) ^ dp) (^j^ |9^(x)|2P duix) + (Var(/3))^) , 

where YaiiP) = / c/? ix)duix). 
Jx 
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Proof. Let p ^ 1. Suppose ||(/3||oo < +c« first. As a Poisson process is infinitely divisible, we can write: 
for any positive integer k, 

k 



i=l 



where the iV*'s are mutually independent Poisson processes on X with mean measure v/k. Hence, 

k ^ k 






Yi= ip{x) [dNi - k-^dv{x)). 



where for any i, 



So the 1^'s are i.i.d. centered variables, each of them has moments of order 2p and 2. We apply the 
classical Rosenthal's inequality (for instance, see Proposition 10.2 of [15]): there exists a positive 
constant C{p) only depending on p such that 



E 



i=l 



2pN 



< 



c{p)[Y,nm''n + \Y.^iy?) 



a=l 



vi=l 



/ k \ 

Now, we give an upper bound of the limit of E > |1^| for i G {2p, 2} when k 

introduce 

^fe = {V^G{l,...,A;},iVi^l}, 

where N^ is the number of points of N'^ lying in X. Then, 

P(f]^)=P(3iG{l,...,/c},iVi^2) 



^^P(iVi^2) = A;^ 



i=l 



i>2 



j! 



^ A:(z^(X)/A;)2 = k'^v{Xf 



oo. Let us 



On J]fc, if iV| = (so / v9(x) dNi = 0), 



ll^d' = Ok{k- 



and if A'^^ = 1 (so / vj(x) dA^^. = y^{T), where T is the point of the process A^*), 



|y,r = |^(r)r + Ofc(fc-V(T)| 



£-l^ 



Consequently, 



.(g,.,') 



^E 



In, Uo,(fc-^) + j; [|(^(r)|^ + o,(fc- Vmr ^)] 



TeAf 



+ a/P(0 



1 



E 



Ei«i 



^i=l 



5.1 



Nonparametric estimation in a Poissonian interactions model 



21 



But we have 



Ei^^N2-^E 



j=i 



ifix) dNl 



+ {k-^ f \ip{x)\du{x) 
^\\lNi + k(k-^ lj^{x)\du{x)^ 



i~l II. ,11^ 7\rf 



^2 



Thus, when /c — )• oo, the last term in (|6.ip converges to since a Poisson variable has moments of 
every order and 



limsupEl Viy^n i^^f [ \ipix)fdN,) = [ 



\<p{x)f duix) 



which concludes the proof in the bounded case. 

But for any function if such that f^ \(p{x)\'^^ du{x) < +c«, the desired upper bound is finite and we 

get it by approximating (p by, for instance, piecewise constant functions. D 

6.3 Proof of ILemma 2.11 

Let A G A be fixed. G{(p\), defined by (|2.3p . is a measurable function of the observations and by 
considering the aggregated process (jl.ip . we can write 



a(v> 



E 

4=1 



77 — 1 

Mt - Ui) ^.{vx{t - U)) 



n 



dNt 



[ V Mt - Ui) dNt - (n - 1) / K^{ipx{t - U)) dNt 
Y^ I ^^{t-Ui)dNl- Yl f^AMt-U))dN^ 



E 

1=1 



/ ^x{t - Ui) dNi + y^l [px{t - Ui) - E^(v9A(t - U))] dNi 

JR ■ , • JR 



Now, we prove the first part of ILemma 2.11 We have 

nG(v>.)\u,,...,u„) 



E 

1=1 



/ ipx{t - U^)h{t -Ui)dt + y^f [ipxit - Ui) - E,(vPA(i - U))]h{t - Uj) dt 

JR ■ / • JR 



Write x = t — Ui in the first integral. Therefore, 

EiG{ipx)\Uu ...,Un) = n [ ipx(x)h{x) dx + W{ipx), 

JR 



where 



W{vx)= Y. f Wx{t-Ui)-E^{^x{t-U))]h{t-Uj)dt. 

1 ^a-L:.^„ J^ 



l^j^j<;n 
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Moreover, 



V / E[ipx{t - Ui) - E^{vx{t - C/))]E(/i(t - [/,)) dt 



0. 



Finally, 



^{G{vx)) = n I (px{x)h{x)dx, 
i.e. /3x is an unbiased estimator for f3x' 



E0x) = E 



( Gjifx) 






n 



(px{x)h{x)dx = /3a. 



It remains to control the variance of the estimator f3x- 



Var(G((^A)) = E 



G{ipx) — n / ipx{x)h{x)dx 



E[(G((^a) - E(G(99a)|^1, ...,Un) + W{^x)f] 
E{V{^x))+nW{^xf), 



where 



y((^A) = Var(G((/.A)|t/i,---,f/n). 
We start by dealing with the first term by using technics for Poisson processes. We have 

Z If, 



V'(w 



E 



Ti — 1 

Vx{t - Ui) E^{^x{t - U)) 



n 



^h{t-Uj)dt 



i=i 



/ V] I Mt - U,) + V [fx{t - Ui) - E^{ipx{t - [/))] I h{t - Uj) dt 

n „ 

Y, A{t-U,)h{t-U,)dt 

n „ 

+ 2V / y^[px{t-Ui)-E^{ipx{t-U))]^x{t-Uj)h{t-Uj)dt 

n .. 

+Y. Y.Y. [^^(* - ^^) - ^-('^^(^ - ^))] ['^^(^ - ^^) - ^-(v^aI* - u))] h{t - Uj) dt. 

7 = 1 ■^^ i^j k^j 



In the first integral, write x = t — Uj. So 



Viipx) = n [ ^lix)h{x) dx + 2 V / V [v9A(t - U^) - E^iipxit - U))] ifxit - Uj)h{t - Uj) dt 

n „ 

+ 2Z / m^ [v5A(t - Ui) - E^iifxit - U))] [ifxit - Uk) - E^i^xit - U))] h{t - Uj) dt. 
,=1 J^ i^i k^j 



5.2) 
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Each term can be computed by taking the expectation conditionally to Uj (in each sum) and we obtain 



/ ^l{x)h{x) dx + y^ I VeT [(/PA(t - Ui) - E^((^A(t - U))f]nh{t - Uj))dt 



= n iflix) h{x) dx + n{n - 1) / Var^iifxit - U))E^{h{t - U)) dt. 

Jr Jr. 

Then using equations (a) and (b) of lLemma 6.1[ we have 



(6.3) 



E{V{^x)) ^ n I fl{x)h{x) dx + "^^^ ^^ [ <fl{x) dx I h{x) dx. (6.4) 



Now, we deal with the second term by using the [/-statistics technics. However, W{(px) is a 
[/-statistics of order 2 but it is not degenerate. So we write 



Wiipx) = Wi{ipx) + W2{ipx), 



with 



Wi{ipx)= Yl j [px{t-Ui)-¥.^{^x{t-U))\¥.^{h{t-U))dt 

n „ 

= (n-l)V / [ipx{t-U,)-¥.^{^x{t-U))\¥.^{h{t-U))dt 



and 



where 



W2iv^x)= Yl 9iU^,Uj), 



g{U,, Uj) = / [ipx{t - [/.) - E.(</^A(i - U))] [h{t - [/,) - E^(/i(t - [/))] dt. 

Jr 

W2{(px) is a degenerate [/-statistics. It is easy to verify that 

E{W{ipxf) = E{Wi{ipxf) + E(W2{^xf). 

First we compute EiWi^ipx)"^)- 

E{Wi{^xf)=YaTiWiiipx)) 

= n{n - l)2Var (f [^x{t - U{) - E^{^x{t - U))\E^{h{t - U)) dt 

= n{n - l)2Var ( / ^x{t - Ui)E^{h{t - U)) dt 

i^n{n-lfE 
n{n — 1)^ 



\^xit-Ui)\E^ihit-U))dt 



< 



< 



2^2 

n{n — 1)^ 



E 



\ipx{t-Ui)\dt 



h{x) dx 



ipx{x)\dx\ I / h{x)dx\ , 



by applying inequality (c) of lLemma 6.11 with f = h. 



3.5) 



5.6) 
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It remains to compute K(W2{tp\)'^)- It is easy to see that 
nW2{vx?) = Yl ^[^(t/i, U,){giUi, U,) + g{Uj, Ui))] ^ 2 J] E [g{Uu Uj 



^ 2n{n - 1)E 



[^x{t - Ui) - E^(^A(t - U))] [h{t - U2) - E^(/i(t - [/))] dt 



We denote 'E(jjy^^^^^{g{U,V)) the expectation oi g{U,V) where U ^ tt and V ^ it are independent 
and f^{t) = fit - X). Hence, 



HW2{vxf) 

^ 2n(n - l)E(^y)^^^^ 



^ 2n(n - 1)E 



(;7,y)~7r(g)7r 



C/^5f (t)/l^(t) dt - EVr^n ( I fl{t)h^{t) dt 

IEt/~. U^'^{t)h^{t)dt] +E(^,v'W^. U^'^it)h^it)dt] 



(f7,y)~7r(g)7r 



^2n(n- 1)<^E 

^ 2n(n- l)<E((7y)^^55^ 



E 



C/~7r 



<p^{t)h^{t)dt 

ip^{t)h^{t)dt 



Ef/, 
2^ 



^\{t)h''{t)dt 



if^{t)h^{t)dt 

+ (E^uy)^ncsnUv'xit)h^{t)dt 

{U,V)r^nm ( / ip'x{t)h^{t)dt 
\Jm. 



But, 



E 



((7,y)~7r(g)7r 



<^'i{t)h''{t)dt 



< 



%yW0. ( fi^'(f{t)h^{t)dt f h^{t)dt 



Jr 
"^^ I 'Pl(.x)dx( / /i(x)dx 

-i ./» V./lB 



and 



Er 



(c/,y)~7r(g.7r ( / V^xit)h {t)dt 



by using iLemma 6.T1 So, 



E^((^^(t))E^(/i^(t))dt 



< 



- / \(fx{x)\dx / h{x)dx, 

^ ./R JR 



IE(^2(¥'a)') ^ 2n(n - 1))^ f ^l{x) dx ( f h{x) dx\ + ^ 



T 



\ip\{x)\dx\ ( / ft.(x) dx 



5.7) 



Finahy, by combining inequahties ()6.4p . (|6.6p and ()6.7p . we obtain the following control of the 
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variance of the estimator /3x : 



Var(/3A) = Var 



1 



( Gjipx) 






n 



1 



^ — / (px{x)h{x)dx + — / (fx{x)dx / h{x)dx 

2 / ^ \ 2 



+ 



n 



\ipx{x)\dx 



h{x)dx\ + 7^ / ipxi^)dx[ / h{x)dx 



By using the properties of the biorthogonal wavelet bases considered in this paper, for any A = (j, k) 
in A, we have: ||9?a||i ^ 2"-''^ max(-v/2/2, ||V'l|i) ^-^d ||(/?a||2 ^ niax(l, HV'lb)) which ahows us to get the 
purposed upper bound in lLemma 2.11 



6.4 Proof of [Theorem 31 

In the sequel, we will consider: n ^ 1, T ^ 1 and Jq = 0{n) and we will use following notations: 

Mh,i = max(||/i||i,l), Mh,oo = max(||/i||oo, 1), M^,i = max(||V^||i, ^/2/2), M^,2 = maxdlV^Ha, 1) and 
M^^oo = max(||-0||oo,\/2) (so that, for any A = {j,k) £ A, we have: Hv^aIIi ^^ '^~^^'^M^^i, W^pxh ^ M^,2 
and ||¥?a||oo ^ 2^'/2m^,oo)- 

We recall that A and M are positive real numbers such that h and ■0 are compactly supported in 
[—A; A] and in [—M; M] respectively. 

Now, to prove [Theorem 3l we applv [T"heorem 2[ and for this purpose we have to verify Assumptions: 
(Al), (A2) and (A3). 

6.4.1 Proof of Assumption (Al) 

Let A G r be fixed. Remember that conditionally to the f/j's, the expression given in (jl.ip is a 
Poisson process. We apply Lemma 6.1 of |26) : for any a > 0, with probability larger than 1 — 2e~", 
conditionally to the Ui's, we have 



G{(fx) -n (px{x)h{x) dx - W{ipx) 

JR 

where W{(px) is defined by ([63]), V{^x) = Var{G{(px)\Ui, . . . , C/„) and 



a 



^ ^/2aVi^x) + -^Biifx) 



B{^> 






ft 1 

^a(- - Ui) E^(v9a(- - U)) 



n 



Unlike B{ipx), V{(px) is non-observable (it depends on the unknown function h). This is the reason 
why, by fixing a > 0, we estimate V{ipx) by 



Vi^x) = Vi^x) + j2aViipx)B^^x) + 2,aB\^y 



where 



V'(w 



E 



Ti — 1 

Vx{t-Ui) ¥.^{^x{t-U)) 



n 



dm. 



Moreover, by Lemma 6.1 of |26j . we have also: ¥{V{ipx) ^ V{ipx)) ^ e ". So, with probability larger 
than 1 — 3e~", 



G{ipx) — n / ipx{x)h{x)dx 



a 



^ \/2aVi^x) + -^Bi^x) + \W{^x)\- 



(6.8) 
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We provide a control in probability of Wi{ip\). 

n „ 

Wiiifx) = (n - 1) V / [ifxit - U,) - E^((^A(i - U))]W.^{h{t - U)) dt. 

This is a sum of i.i.d. random variables. We apply Bernstein's inequality (for instance, see Proposition 
2.9 of [20]) to get that with probability larger than 1 — 2e~", 



a. 



with 

t;(v9A) = Var(iyi(v9A)) ^ ^^^^^^^ ( / Wx{x)\dx\ U h{x)dx 



(see inequality (|6.6p ) and 



K'fx) = ("-!) sup 

«e[0;T] 



[ipx{t -u)- E,((^A(i - t/))]E,(/i(t - U)) dt 



^ ^^ / \<f\{x)\dx I h{x)dx, 

-f JR JR 

using equations (b) and (c) of lLemma 6.11 Then, with probability larger than 1 — 2e~", 

|T^i((^a)| ^ "^^^l"""^^ / \^x{x)\ dx [ h{x) dx + ^^^^;-^ / \^x{x)\ dx [ h{x) dx. (6.9) 

^ JR JR 'J-' JR JR 

Now it remains to control W2{fx), with 

where 

giU^, Uj) = f [Mt - Ui) - E^(v9A(t - U))] [h{t - Uj) - E^{h{t - U))] dt. 



This is a degenerate [/-statistics of order 2, we can rewrite it as 

W2{fx)= Y. ^(f^^'f^i)' 

where 

g{Ui,Uj)=g{U„Uj)+g{Uj,U,). 

We apply Theorem 3.4 of [14] to W2 and —W2 (keeping the same notations of [14]): for all Eq > 
(eq = 1 for instance), with probability larger than 1 — 2 x 2.77e~°, 

\W2{ipx)\ ^ 2(1 + eof/^C^ + v{eo)Da + P{eo)Ba^/^ + ^{eo)Aa^ (6.10) 

where 

• A= \\g\\oo and by applying equality (b) of lLemma 6.11 with f = h, we easily have 

A^ 8||v3a||oo / h{x)dx, (6.11) 

JR 

• C^ = E{W2{(pxf) and with dHZ]), we have 

C^<^2nin-1)\^ f ^lix)dx(f hix)dx] +^(f \ipx{x)\dx^ ( f hix) dx] i, (6.12) 
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n-l 



L» = sup ^ E I Yl SiUi,Uj)ai{Ui)bj{Uj) : E lY.ai{Uif U 1,E Y.^j{Ujf | ^ 1 
l<:j<i<:n / \i=2 J \j=l 



n i— 1 n— 1 n. 

D = sup <! E I Y,Y.9iUi,U,)aiiUi)b,{U,) + E E 9{Uj,Ui)ai{Ui)bj{Uj 

1=2 j=l j=l i=j+l 



n— 1 



vj=2 



But, with the conditions on the a^'s and the 6j's, we have: 



71 i— 1 



E I ^aiiUif ) ^ 1,E I j;6,(t/,)=^ I ^ 1 



^ EE5(^-^J■)"^(^^)^J■(^J■ 
^ / E I ^ |¥'A(i - f/^) - E.(^A(t - U))\\ai{U^)\ ]^[Y1 H^ - ^J-) - ^-(^(* - ^))|l^j(f^J-)l ) dt 
^ I ^/ {n - l)\a.T^{^x{t - U)) E I ^ |/i(t - C/j) - E^(/i(t - C/))||6j([/j)| ) di 



< 



< 



'n-l 

r 



[ ipl{x)dxE I V /" |/i(t-;7,)-E^(/i(i-;7))||6j(?7,)|dt 



'n-l 
T 



iflix) dx E I 2 y |6j(f/,)| / h{x) dx 
\ .=1 -^^ 



^2(n-l)W- / ipl{x)dx / h{x)dx, 

V -^ ^M ^K 



using equations (a) and (b) of lLemma 6.11 Inverting the a^'s and the &j's, the same computations 
apply to the second term and we obtain 



_D^4(n-l)W— / (pl{x)dx / h{x)dx, 



(6.13) 



j-i 



B = max < sup I > K{Q{u,Uj) ) I ,sup I > E(^(?7j,u) ) I > and since Q is symmetric, we 



vi=i 






have: 



j-i 



52 = sup YE{g{u,Uj 

u,i \ • 1 

V=i 

/n-l 

= sup yE(g(u,C/j 



^n-1 



^2sup I y [Eig{u,Ujf) + EigiUj,uf)] 
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But, 



= E 

^ E 
^ 2E 



[^x{t -u)- E^{ipx{t - U))] [h{t - Uj) - E^ihit - U))] dt 



[i^xit -u)- E^iifixit - U))]^\h{t - Uj) - E^{h{t - U))\ dt / \h{t - Uj) - E^{h{t - U))\dt 



h{x) dx 



[ipxit -u)- E^{ipx{t - U))\^\h{t - Uj) - E^{h{t - U))\ dt 
^^ f [^x{t-u)-E^{ipx{t-U))fdt(jh{x)dx 



and in the same way 



E{g{Uj,uf 
= E 

^E 

^ 2E 
2 



[pxit - Uj) - E^ifxit - U))] [h{t -u)- E^{h{t - U))] dt 



[ipx{t - Uj) - E^{ipx{t - U))f\h{t -u)- E^{h{t - U))\dt / \h{t - u) - E^{h{t - U))\ dt 



< 



T 
4 



[ipx{t - Uj) - E^{^x{t - U))]'\h{t -u)- E^{h{t - U))\ dt 
\h{t -u)- E^{h{t - U))\ dt / Lp\{x) dx / h{x) dx 



h{x) dx 



^ f I 'Pl(.x)dx[ I h{x)dx 



by using [Lemma 6.ll Hence, 



2- ^^^"^ ^' I ^l{x)dx[ I h{x)dx 



(6.14) 



Finally, by inequalities ([63]), dES]) and (f6J0]) combined with (IHTTjl . IKW\ . (l6J2l) and (f633]l . we 
obtain: for any eo > 0, with probability larger than 1 — (5 + 2 x 2. 77)6"°^, 



|^,_,,K/,„,(^),|«(^ 



^/2 



oin ,, ,, 2a,, ,, , ,o/9 , — /I,, ,,„ 1 ,, ,,„ 

J, \vx\\i + ^IIv^aIIi + 2(1 + eQ)^''V2a^-\\^x\\l + 7^\Wx\\l 



+ 4.r]{eQ)a\l -\\^x\\l + P{eQ)a'^'^ ^ —\\<fx\\l + -l{eQ)a^\Wx\ 



^ 



I 2M^,i^ + 2^/2M^,i^ + 2^/2(1 + eo)'"^M^^2^ + 4(1 + sof'^M^^^^ 

a n- o?l'^ 2^0/2^2 1 

+ 47?(eo)M^,2^= + V40/3(eo)M^,2^= + 87(^0)^^,00 \\\Hu 

JT VnT n 
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because 2^-''^ ^ \/2 if —1 ^ j ^ jo- We denote b the quantity between braces above. 

This upper bound depends on h (via \\h\\i) and this potential threshold could not be used for 
applications because h is unknown. So we overestimate \\h\\i by - — ^° "^ and we have a threshold that 
does not depend on h. So, for any value of k g]0; 1[, by fixing a = k'^JoJ with 7 > 0, we define for all 
Ainr, 



where 



A = (1 + eo) 2M^,i^ + 2^/2M^,i|p + 2^/2(1 + sof/^M^^^f^ + 4(1 + eof/'M^,^^ 
+ 4r/(£o)M^,2^ + V40/?(£o)M^,2 ° ^ + 87(^0)^^,, -^^ ^ ^^ 



Thus, for all A in T, 

P(|/3a-/3a|>^^a(7,A)) 



V V Vn/3\n/ n n 



+ F f I/3a - /3a| > j2aV (^) + ^i? (^) + b^l±^^^ , ii±i^^ ^ \\h\ 



^ (5 + 2 X 2.77)e-" +P ("^i^t^^ ^ \\h\u) , 



^' + ^°^^^^INhVFfiVM-n|Hh^ ^°"""'""^ 



with 



n y \ 1 + eo 

^ exp(-5r(eo)ra||/i||i), 

using Proposition 7 of [25] with g{eo) = j^ (log j^^^ 1 j +1. 

2 ■ 

Therefore, Assumption (Al) is true if we take 00 = (5 + 2 x 2.77)e~'^ ■'"'^ + exp (— (7(eo)?i-||^||i), with 
7 > and £0 > 0. Furthermore, the threshold (12. 4p that lies at the heart of the paper is achieved by 
rewriting A by grouping the constants into one: 

A = d(7, llVlli, llV'lb, ll^lloo) y-^ + ^ + ^1 (6.15) 

with 

(i(7,||VI|l,IIV'l|2,||V'l|oo) 



= (1 + £o)l 2V7M^,i + ^7M^,i + 2^2(1 + Sof^^V^M^^^ + 4(1 + Sof^'^V^M^,! 

+ 4ry(eo)7M^,2 + \/40/3(eo)7^^^^^,2 + 87(60)7^^^,00 [, 
where /3(eo), 7(eo) and r/(eo) are defined in [H] with eq = 1- 



(6.16) 
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6.4.2 Proof of Assumption (A2) 
Let A G r be fixed. For any p ^ 1, 



E(|/3a - (3x\^n = E 



G{ — )— / ipx(x)h{x)dx 
n ^ Jr 



2p> 



^E (\G{ipx) - E{G{ipx)\Ui, ...,[/„) + W{^x)f'') 



€ 



n 

22p-l 



n 



2p 



[n\G{^x) - nG{^x)\ui, ..., Un)\^n + Ei\wiipx)\^n] ■ (6.17) 



Now, let us give an upper bound of each term of the right-hand side of the previous inequality. 
We first study the first term of ()6.17p . We have: 

EilGi^x) - EiGiipx)\Ui, . . . , Un)\^n = E[E(|G(<^a) - EiGiipxWu ..., [/„)pP) | C/i, . . . , [/„)] 



and conditionally to the C/j's, A^ is a Poisson process. We apply [Lemma 6.2] for any p ^ 1, there exists 
a positive constant C{p) only depending on p such that 



E(|G(v?a) - E(G(v9A)|f/i, . . . , Un)\''n \Ui,...,Un) 



^C(p) 



E 



Tl — 1 

V>x{t - Ui) E^(v9A(t - U)) 



n 



2p „ 

Y^h{t-U,)dt + V{ipxf\ . 



(6.18) 



On the one hand, we provide a control in expectation of the first term of ()6.18p . We have: 

2P n 



E 



H[E 



ft \ 

Vx{t - Ui) E^{^x{t - U)) 



n 



j=i 



^h{t-Uj)dt 

2p 



^A(t - U,) + Y, [Px{t - Ui) - E^{^x{t - U))] 



i¥=j 



h{t-Uj)dt 



^ 2^P- 



EijY,\Vx{t-Uj)\^Ph{t-U,)dt 



+H4E 



Y,[Mt-U)-E^{Mt-u))] 



2p 



h{t- Uj)dt 



with 

El / V |^A(t - Uj)\'^Ph{t - Uj) dt] =n [ 



M^)\^''Hx)dx^n / ^lix) dxyxf£ 



and 



^I4S 



Y,[Mt-U)-E^{ipxit-U) 



i¥=j 



2p 



h{t- Uj)dt 



n E 



Y,[Mt-Ui)-E^{ipx{t-U))] 



^¥=3 



2p\ 



E{h{t-Uj))dt 



n-l 



Y,[Mt-U)-E^{ipx{t-U))] 



i=l 



2p\ 



E^{h{t-U))dt. 
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By applying Rosenthal's inequality, there exists a positive constant C{p) only depending on p such 
that 



E 



^ 



n-l 



2p\ 



Y^ [ifxit - Ui) - E^i^xit - U))] 

i=l 

dp) ((n - 1)E (|(^A(t - C/i) - E.((^A(t - U)f'') + {n- lY[\^T^{ipx{t - U))]") . 



But, 



E (|(^A(t - f/i) - E^{^x{t - U)f') ^ K (E.(|(pA(i - U)\''n + |E.((^A(t - C/))|'^) 



1 



T 



^K \- j \^x{t-u)\''Pdu + 



ip\{t — u) du 



2p\ 



^K - 



(iX.i,.,..li..ii-.^(/_ 



\ 2pN 

(px{x)\dx\ 



with K a positive constant only depending on p and using inequality (a) of lLemma 6.H 



Yar^i^xit - U)) i^ - / ipl{x)dx. 



Thus, 



E 



E 






2p 



h{t-Uj)dt 



rr I ^ J- II ii9ii iiO-n— 9 ^ J- 

^niT ( ^^II^aIIsII'/'aII^ ' + 



l|2p , (?^-lF || ||2p 
2-2p ll'^^lll + ^p Il'^^ll2 



^ R- / '^^l l|2|| ||2p-2 , n"^ II ||2p , "-^^^1 ||2p 
^K ( yII'/'a||2|I'/'a||o£ +j^\\f\\\l+^j^\\'Px\\2 



E^{h{t-U))dt 



using equation (b) of lLemma 6.11 

Therefore, we have the following control of the first term of (|6.18|) 



E 



^K 



E 



77 — 1 

^xit-Ui) E^{^xit-U)) 



n 



2p 



^/i(t-C/j)(it 



n\Wx\\l\Wx\?£-'\\h\\oo 



'i 11 1 1 2 1 1 II In 2 ^ 1 1 



w .. ,,o,, ,,o r, n" ,, ,,2p n^"*""^ 



2^2pl 



TP 



l|2p 



(6.19) 



Now let us provide a control in expectation of the second term of (|6.18p . \.eV{ipxY- First, we recall 

that V{ipx) = \a.i{G{ipx)\Ui, . . . ,Un) and we remark that E{V{^xY) < \^{y{^x?^)Y''^ (using the 
Cauchy-Schwarz inequality). So, we focus on the moments of V{ipx) of any order m ^ 2. 
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Let m ^ 2. According to the expression (|6.2p of V{<p\), we have: 



n I ^l{x)h{x) dx + 2 V / V [ipx{t - Ui) - ¥.^{ipx{t - U))\^x{t - Uj)h{t - Uj) dt 

n „ 

+ Y. 12J2 ['^^(^ - ^^) - ^-('^^(^ - ^))] ['^^(^ - ^k) - ^Avxit - u))] h{t - u,) dt 

i=l ■'^ ij^j k^j 

n I iplix)h{x) dx + 2 V / V [^x{t - Ui) - E^iipxit - U))]E^iipx{t - U)h{t - U))) dt 
Jr .^^ Jr ^^. 

n „ 

+ 2 V / V [pxit - Ui) - E^i^xit - U))] [ifxit - Uj)h{t - Uj) - E^{y,x{t - U)h{t - U))] dt 

n .. 

+ Y. Y.Y1 [^^(* - ^^) - ^-('/'A(t - U))] [ipxit - Uk) - E^iifxit - U))]E^{h{t - U)) dt 

j = l -^^ ijLj k^j 
n „ 

+ Y. Y.Y1 ['^^(^ - ^^) - ^-('/'A(t - U))] [ifxit - Uk) - E^iifxit - U))] 

i=l ''^ ijkj k^i 



X [hit - Uj) - E^ihit - [/))] dt. 



This formula provides a decomposition of Viipx) in a sum of degenerate [/-statistics of order 0, 1, 2 
and 3. Indeed 

Vi^x) = mi^x) + Wi((/^a) + W2(v?a) + W3((/^a), 
with Wii^px) is a degenerate [/-statistic of order i defined as follows: 



W3(<^> 



Y, I [^xit - Ui) - E^i^xit - U))] [ifxit - Uk) - E^i^xit - [/))] 

l^^i^j^k^n-''^ 

X [hit - Uj) - E^mt - [/))] dt, 



W2iipx) 



'^ Y. I ['^^(* - ^^) - ^-(V'aI* - U))] [ifxit - U,)hit - Uj) - E^iifxit - U)hit - [/))] dt 
+ (n - 2) Yl [ [^A(i - U^) - E^iifxit - [/))] [^xit - Uk) - E^i^xit - U))]E^ihit - [/)) dt 



l^i^^fc^n" 



+ E 



[ifxit - Ui) - E^ifxit - U))f - Yar^ifxit - U)) 



[hit - Uj) - E^ihit - [/))] dt, 



^^lifx) 



n .. 

: 2(n - 1) V / [ifxit - Ui) - E^if^xit - U))]E^iifxit - U)hit - [/)) dt 
i^^jR 

+ (n - 1) V / [ifxit - Ui) - E^ifxit - [/))] ' - Var^((^A(t - U)) 

n „ 

+ (n - 1) V / Yar^iifxit - U)) [hit - Uj) - E^(/i(t - [/))] dt 
j=iJ^ 



E^ihit-U))dt 
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and 



Wo{f\) = n I ipi{x)h{x) dx + n{n - 1) / \aT^{(fx{t - U))E^{h{t - U)) dt 

nln — 1] 



^{V{ipx)) ^n / ^i{x)h{x)dx + 



ipi{x)dx / h{x)dx, (6.20) 



by using ()6.3p and ([67 

First, we are interested in the moments of Wi((/7a) that we write: 

Wi(vja) = m,i{fx) + Wi,2(^a) + >Vi,3(v'a), 
with: 

n .. 

• Wi,i(v?a) = 2(n - 1) V / [px{t - Ui) - ^A^x{t - U))\¥.^{^x{t - U)h(t - U)) dt. 



We have: 

E(iwi,i(<^A)r) 

= 2™(n-l)™E 



^2™(n-l)™ xC{m)\ nE 



n „ 

V / [px{t - Ui) - E^((^A(t - C/))]E^(^A(i - U)h{t - U)) dt 

[ifxit - Ui) - E^(v9A(t - f/))]E^(v9A(t - U)h{t - U)) dt 



+ n 



m/2 



Var / [ipxit - Ui) - E^((^A(i - U))]¥.^{ipx{t - U)h{t - U)) dt 



m/2 ' 



using Rosenthal's inequahty, where C{ni) is a positive constant only depending on m. But, applying 
ILemina 6.1\ 



E / [ipx{t-Ui)-E^{^x{t-U))]E^{vx{t-U)h{t-U))dt 

\ JR 

^ ^ ( / l'^^(^)l^^) i \v\{x)\h{x)dxj 

and 

Var (f [<fxit - Ui) - E^{^x{t - U))\E^{^x{t - U)h{t - U)) dt 
= \&t{ I ifxit - Ui)E^{ipx{t - U)h{t - U)) dt 



^ E 



\^x{t - Ui)\E^{\^x{t - U)\h{t - U)) dt 



^J^\ I \^\{x)\dx 



\(px{x)\h{x) dx ) . 
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So, 



IE(|Wi,i((^A)r) ^ 2-(n - ir X C(m) 



^i^i 



^Ki, 



n 



3m/2 



^3m/2 



2™n 

m 



Lpx{x)\dx\ ( / |c/?A(a;)|/i(x)(ix 
(/?A(a;)|(ix ) ( / \(px{x)\h{x) dx 



with Ki 1 a positive constant only depending on m. 



ipx(x)\dx\ i \ipx{x)\h{x) dx 

2m 

^x{x)\dx' ii^ii- 



(6.21) 



n „ 



[(^A(i - C/i) - ^AMt - U))]^ - Var^(<^A(t - U)) 



E^{h{t-U))dt. 



We have: 

iE(iwi,2(v^A)r) 

= (n-irE( X; 
< n™ X C(m) ( TiE 

m/2 



[ifxit - Ui) - E^((^A(i - U))f - Var^(<^A(t - U)) 



E^{h{t-U))dt 



[ifxit - Ui) - E^(vPA(i - U))f - Yav^iifxit - U)) 



E^{h{t-U))dt 



+ n" 



Var 



[^xit - Ui) - E^{ipx{t - [/))]' - Var^(^A(i - U)) 



E^{h{t-U))dt 



m/2 ' 



using Rosenthal's inequality, where C{m) is a positive constant only depending on m. But, applying 
ILemma 6.H 



E / [(^;,(t-C/i)-E^(<^A(t-t/))]'-Var^(vPA(i-f/)) 
V JK L 

^ Ki2 ( / ip\{x)dx\ ( / h{x) dx 
\Jr J \Jm. 

with Ki^2 a positive constant only depending on m and 
Var 



K^{h{t-U))dt 



[ifxit - Ui) - E^iifxit - U))] ' - Yai^i^xit - U)) 
Var (j [pxit - Ui)-E^{^x{t - U))\\^{h{t - U))dt 



E^{h{t-U))dt 



CE 



^j^^ 



[ifxit - Ui) - E^iifxit - U))] ^Ejh{t - U)) dt 



[fx{t-Ui)-E^{ifx{t-U))\''dt 



h{x) dx 



<K,._l,[JA(.u.)\l 



h{x) dx 
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So, 



iE(iwi,2(v^A)r 



^ Ki,2 n 



n 



^K, 



n 



T 

3m/2 



^\{x) dx 



/ h{x)dx\ + 



m ^m/2 



ip^{x)dx\ I / h(x) dx 



'J'r, 



'^\{x) dx\ { I h{x) dx 



(6.22) 



n „ 

Wi,3((^a) = (.n-l)y2 Var^(99A(t - U)) [h{t - Uj) - E^(/i(t - U))] dt. 



We have: 

E(iwi,3(v^A)r 

= (n - 1)"E 

^ (n - 1)"" X C(m) ( nE 



n .. 

V / Var^(^A(t - U)) [h{t - Uj) - E^{h{t - U))] dt 

,=1 -^K 

Yar^iipxit - U)) [h{t - Ui) - E^{h{t - U))] dt 
Var ( / Vaiv(<^A(t - U)) [h{t - Ui) - E^(/i(t - U))] dt 



+ n 



m/2 



m/2 ' 



using Rosenthal's inequahty, where C{m) is a positive constant only depending on ni. But, applying 
ILemma 6.H 



E 
and 



Var^((^A(i - U)) [h{t - Ui) - E^(/i(t - U))] dt 



m\ r,m 



< 



J^ri 



^x{x) dx \ I / h{x) dx 



Var ( / Yar^i^pxit - U)) [h{t - Ui) - E^{h{t - U))] dt 



Var / Yai\{ipxit - U))h{t - Ui) dt 



^E 



< 



Var^(v?A(i - U))h{t - Ui) dt 

2 

' h{t- Ui) dt 

2 



ipi{x)dx] E 

2 



1 

^ ^ ( / ipi{x)dx ) ( / h{x)dx 



So, 



IE(|Wi,3(v^A)r)^(n-irxC(m)| 



2™n 



(fxi^) ^^ ( / ^(^) ^^ 



+ 



n 



m/2 



^K 



n 



3m/2 



1,3 



(Px{x) dx 



(Px{x) dx 
h{x) dx 



h{x) dx 



(6.23) 
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with Ki^s a positive constant only depending on m. 

Next we deal with the moments of W2(va) that we write: 

W2(9?a) = W2,i((^a) + W2,2(93a) + >V2,3(V^a), 



with: 



>V2,i((^a) = 2 ^ /" [Mt-U^)-E^{Mt-U))] [Mt-Uj)Ht-Uj)-E^{ipx{t-U)h{t-U))] dt. 



l^ij^j^n 



We want to use Theorem 8.1.6 of [6] (a moment inequality for [/-statistics using decoupling) so we 
write: 

W2,i((^a) = 2 Y1 fiUi,Uj), 
where 



f{Ui, Uj) = / [ipxit - Ui) - E^(v9A(t - U))] [fx{t - U,)h{t - Uj) - K^{ipx{t - U)h{t - U))] dt. 



There exists a positive constant C2,m depending on m only such that 



^C2,^n™E(|/([/i,C/2)r) 



^ C2,mn"'E 



^ C2,„n™E 



xE 



[ipxit - Ui) - E^i^xit - [/))] [Mt - U2)h{t - U2) - E^{^x{t - U)h{t - U))] dt 

\ m/2' 

[ipxit -Ui)-E^{^x{t-U))]^dt] 

^ m/2 

[Mt - U2)Ht - U2) - E^i^xit - U)h{t - U))fdt 



m/2 / /> \ m/2 

^ K2,i n" ( / ipl{x)dx ] ( / !pl{x)h^{x)dx 



^^2,1 n™ / ipl{x)dx 



m 

00 ' 



by applving ILemma 6.11 and setting i^2,i a positive constant only depending on m. So, 



E(|W2,i(/)r) < ^2,1 n^ / A{x) dx] \\h 



(6.24) 



We may write: VV2,2(va) = ("- ~ 2) \^ f{Ui, Uk), where 



fiUu Uk) = / [ipxit - Ui) - E^{ipx{t - U))] [ipxit - Uk) - E^i^xit - U))\E^{h{t - U)) dt. 



Nonparametric estimation in a Poissonian interactions model 



37 



We use Theorem 8.1.6 of [6]: there exists a positive constant C2,m depending on m only such that 

^C2,mn"'E{\f{Ui,U2)\ 



^ C2.mn"'E 



[^x{t - Ui) - K^iifxit - U))] [^x{t - U2) - ^Avxit - U))]E^{h{t - U)) dt 



^K2 2 < E 



[ipx{t-Ui)-E^{ipx{t-U))]^dt 
^ K2,2 j^il vl{x) dxj I h{x) dx 



m/2' 



h{x) dx 



by applying ILemma 6.11 and setting K2.2 a positive constant only depending on m. So, 



E(|W2,2(/)r) ^ K2,2 



n 



2m 



^xi^) ^x ] ( / ^(^) '^^ 



(6.25) 



We may write: W2,3(v?a) = /^ f{Ui,Uj), where 



fiUi,Uj) 



[^t - Ui) - E^i^xit - U))] ' - Var^iipxit - U)) [h{t - Uj) - E^(/i(t - U))] dt. 



We use Theorem 8.1.6 of [6]: there exists a positive constant C2,m depending on m, only such that 



E 



E fiU^,UJ) 
^C2,mn"'E{\f{Ui,U2W 



-'2,m' 



'-2,3 



[[^xit - Ui) - E^i^xit - U))f - Yai^iifxit - C/))][/i(t - f/2) - E^(/i(t - [/))] dt 



[ifxit - Ui) - E^{^x{t - U))] ' - Var^((^A(i - U)) 



dt 



^K2,zn'^[ / 'fl{x)dx] \\h 



by applving ILemma 6.11 and setting -ftr2,3 a positive constant only depending on m,. So, 

E(|>V2,3(v^A)r) ^ i^2,3 n™ f / ^l{x)dx 



\m 

loo' 



(6.26) 



And finally, we focus on the moments of W^i^x) that we write: Wzifx) = / f{Ui,Uj,Uk), 

where 



f{u^,a 



,Uk) = [ [ipx{t-U^)-E^{Mt-U))] [ipx{t-Uk)-E^{ipx{t-U))] [h{t-Uj)-E^{h{t-U))] dt. 

JR 
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We use Theorem 8.1.6 of [6]: there exists a positive constant C^^m depending on m only such that 

^C3,™n3™/2E(|/(C/i,t/2,C/3)r) 



^C.^n^-Z^El 



[pxit - Ui) - ^^{ifxit - U))] [ipxit - U2) - E^(99A(t - U))] 

X [h{t - Us) - E^{h{t - U))] dt 



^ K3 n3™/2E 



[ifxit - Ui) - E,((^A(t - U))] [ipxit - U2) - E,(vPA(i - U) 



dt 



m 

001 



by applying ILemma 6.11 and setting K^ a positive constant only depending on m. Furthermore, usin^ 
the support properties of the biorthogonal wavelet bases considered in this paper, we have 



E 



{ I [vx{t - Ui) - E^(^A(t - U))] [ipx{t - U2) - E^(^A(t - U) 



dt 



E 



Vxit - Ui)^x{t - U2) - ifxit - Ui)E^{vx{t - U)) - ifxit - U2)E^{vx{t - U)) 

+ [E^{fx{t - U) 



dt 



^K.lE 



ipx{t - Ui)ipx{t - U2) 



dt 



■E 



^x{t - Ui)E^i^xit - U)) 



dt 



with: 

E 



ifxit - Ui)ipx{t - U2) 



dt 



^ Jo Jo 



< 



1 



T 



dui 



+ E 

dU2[ 

UI+2M 



U-, -2M 



[E^{ipx{t-U))fdt 



fxit -Ui)ipxit-U2)\dt 

du2 ( / ifxi^) dx 






E 



^x{t-Ui)E^{^x{t-U)) 



dt 



E 



2^m 



1 



T 



fxit-Ui)(^l ipx{t-u)du 



dt 



T 



du I \Lpx{i — Ui)ipx{t — u)\dt 



€ 



< 



^ 



1 



T 



T 



j.,„+i ; duA du I \ipx{t-ui)^x{'t-u)\dt 



1 



{AMY 



du\ 



U1+2M 



U1-2M 



du / ipx{x) dx 



ifxix) dx 
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and 



/ 

Jr 



[ [E^i^xit - U))]'^ dt = ^ [ dt [ du^x{t-u) I dvifxit-v) 

R -^ JK JO Jo 



dv I ipx{t — u)ipx{t — v) dt 



T^Jo '" 







^ 



2^2 



du 



U+2M 



U-2M 



dv / ifx (x) dx 



So, 



H\m{vx)n^K',n^^/^ 



^ -^ / V^x{x)dx. 

-^ Jr 



— I / (pl(x)dx] + — — 



(/';^(x) dx 



(6.27) 



with ETg a positive constant only depending on m and the compact support of tj). Note that if we had 
used the same method as for the control of the moments of W2^2{'^\), we would not get the correct rate 
of convergence. We obtain a better rate of convergence thanks to the properties of the biorthogonal 
wavelet bases used here. 

Thus, combining inequalities (lOOl) . (lOTT) . (lOl) . (lOHll . (lOll . (lOSi) . (IOGD and (fOTll yields 






+ 






„2m 
m _|_ ^^^^ 



C/9_;^(x) dx 



+ 



j^3m/2 



+ 



n 



2m 






|c/9A(a:)|(ix 

C/3;^(x) dx 



2m 



n' 



3m/2 



oo "'" Jim 



/i(x) dx 



^xi^) dx 



h{x) dx 



h{x)dx\ + n"^ [ (pl{x)dx 

h{x) dx] +n"'- [ I ip\{x) dx 



^( / Vl{x)dx] + 



^[f/x{^)dx) 



^K 



n 



3m/2 



IV^Alll 



2m 1 1 L lim 



+ 



n 



2m 



IV'Alb 



2m 1 1 1, lim 



+ 



n™ + 



n 



3m/2 



2m 1 1 J, 1 1 m I 



I , „ II 2m 1 1 L 



with X a positive constant only depending on m and the compact support of ijj. So, we obtain 



I n3p/2 



n 



2p 



jp 



00+ j^ll¥'A||^^||/^||? + 



nP + 



n' 



3p/2 



IV'Aliril/^IISo , 



(6.28) 



with K a positive constant only depending on p and the compact support of if). 

To conclude for the first term of (|6.17p . using inequalities (|6.19p and ()6.28p in ()6.18p . we have 



E(|G((^A)-IE(G(</.A)|^i,...,^n)|'^ 

,2 _ p+1 



^ K 



n II ||2p 



1 + 



n'^ 



+ 



TP 

2P||/,I|P 



l|2p 

fxh 



I '^ II II 2 II ||2p-2||i,|| I II II 2 II l|2p-2| 
1 + — llc^AlbllV'AlloS ll^lll+'^IIV'Alblly'AlloS ll-lloo ^^_^ 

^ (6.29) 



ri3p/2 2p 

II l|2pi|i IIP I " II i|2pi|i IIP I 

^^^II'/'aIIi ||/i||So + j^llv'Alla l|/i|li + 



nP + 



n 



3p/2 



I^^aII^II/^IISo 
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Now, we have to focus on the second term of (|6.17p . Recah the definition ()6.5p of W{(p\) 

W{ipx)=Wi{ipx) + W2{ipx), 



with 



and 



where 



So, 



n „ 

Wiiifx) = {n - l)y] / [vx{t-U^)-E^{ipx{t-U))]E^{h{t-U))dt, 



W2{^x)= Yl 9iUi,Uj), 



g{Ui,Uj) = [ [ipxit - Ui) - E^i^xit - U))] [h{t - Uj) - E^{h{t - U))] dt. 

E{\w{ipx)fn ^'2^''-'M\wi{^x)fn +H\w2{vx)fn]- 



On the one hand, we have to control E(|VFi((/3a)P^)- We use Rosenthal's inequality: there exists a 
positive constant C{p) only depending on p such that 



Ei\w,i^x)\'n 

= {n- ifm 



n .. 

V / [px{t-Ui)-E^{ipx{t-U))\E^{h{t-U))dt 



2p\ 



^ (n - ifv X C{p) nE 



[px{t - Ui) - E^i^xit - U))]E^{h{t - U)) dt 



2p\ 



+ nf 



Var / [ipx{t - Ui) - E^{^x{t - U))\E^{h{t - U)) dt 



But, applving ILemma 6.H 



E 



/ [px{t - Ui) - E^{y,x{t - U))]E^{h{t - U)) 



dt 



2p\ 






< 



22p 



[ipx{t-Ui)-E^{ipx{t-U))]dt 



I 99a (2;) I dx I I / h{x) dx 



2p\ 



\ 2p 

h{x) dx I 



and 



Var (f [^x{t - Ui) - E^{^x{t - U))\E^{h{t - U)) dt 



Var / ipx{t - Ui)E^{h{t - U)) dt 



^E 






^x{t-Ui)\E^{h{t-U))dt 
2' 



\^x{t-Ui)\dt 



^ ^ ( / \Lpx{x)\dx] ( / h{x)dx 



h{x) dx 
2 
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So, 



E{\Wi{ipx)fP) ^K^n 



2p 



^^i, (/j^.WMx) % I, (/^ 



\ 2p- 

\ipx{x)\dx] 



\ 2p 

h{x)dx] , (6.30) 



with Ki a positive constant only depending on p. 

And on the other hand, we have to control E(|W2(v^a)P^)- We have: 

We use Theorem 3.3 of [11] associated with Theorem 1 of [7j (we keep the same notations of [IJJ). We 

set /ij ,• = < , . and we consider ([/■ ,i = 1 . . .n) and ([/• ,i = 1 . . . n) two independent 

[ g otherwise ' / v i 

copies of {Ui,i = 1 . . . n). With Theorem 3.3 of [IJj, there exists an universal constant K such that 

^ K^P[i2p)PC^P + (2p)2Pi?2p + {2pfPB'^P + {2p)*PA^P] 



E 






(1) rr(2)^ 



where 



A = max ||/ijj||oo = ll^lloo- But, for all (x,y) G 



*j 



15(2^, y)l 



[(/JA(i - x) - E^(99A(i - U))] [h{t -y)- E^(/i(i - [/))] dt 
^ 4 / |c^A(2;)|d3;||/i||oo, 
using equality (b) of lLemma 6.11 with f = ^x- So, 

^ ^ 4 / |v?a(2;)| d3;||/i||oo, 

Jr 

C2 = J]E(/i2^.(C/,(^),C/f )) = Y,Ei9\U,,Uj)). But, for all z^j, 



(6.31) 



E(5'(C/„^,-)) 



^ E 



[pxit - U,) - E^{^x{t - U))] [h{t - U,) - E,(/i(t - U))] dt 



^ E 
^ 2E 



[^x{t - Ui) - E^{^x{t - U))f\h{t - Uj) - E^{h{t - U))\ dt / \h{t- Uj) - E,(/i(t - U))\ dt 

I [ipxit - Ui) - K^{ipx{t - U))]^\h{t - Uj) - K^{h{t - U))\ dt I h{x) dx 
Jr J Jr 

^^ fE([^x{t-U,)-E^{^x{t-U))]^) dt( f h{x)dx 



^Tf (/3i(x)(ix ( / h{x)dx 

^ Jr \Jr 

using [Lemma 6.II So, 



C'^^^^^^ f^l{x)dx{ I h{x)dx] , 



(6.32) 
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B = max 



Y,MhUU„y)) , ^E2{hlix,U,)) 



with 



otherwise ^ T 



MhUu.,y)) = \ -^^^^^'^^^ -ZL. ^^1 ^li-)d^{ I mdx 



and 



MhUx,U,)) 



''n"""o.Lt4e<T/-50-^)-(/''(^)- 



using estabhshed inequahties to get equation (|6.14p in the proof of the assumption (Al). So, 



5'^^ Ul{x)dx(j h{x)dx\ , 

-^ Jr \Jr J 



(6.33) 



By using estabhshed inequahties to get equation ()6.13p in the proof of the assumption (Al), we 
obtain 



D ^ 2ni — / ip'j^{x) dx I h{x)dx. 



(6.34) 



Moreover, we use the equivalence of Theorem 3.3 of [11] and the decouphng inequahty provided in 
Theorem 1 of [7j to obtain the fohowing upper bound of E(|W^2(y'A)P^)- 



n\w2{vx)?n^K2 



w(/.''5o.o^^y(/.'.w<^^)* + ^(/. 



P / j- \ 2p 

V9^(x) dx\ ( / h{x) dx 



\ifx{x)\dx 



2p 



(6.35) 



with K2 a positive constant only depending on p. 



Finally, by using inequalities (|6.29p . (j6.30p and (|6.35p in (|6.17p . we obtain: 



E(|/3a - /3a 



2p\ 



^ K 



1 



^2p-2'J^2p 



I'/'aIIi 



1 + 



1 



nP-^TP 



vxtnhh + 



1 



n2p-2j^ 



\vx\\lyx\\'£-'\\h\\i 



+ ^yx\\iyx\\'£-'\\h\\oo+ ^ 



^2p-l 
1 



ri' 



p/2rpp 



1 



|2P||/,J|P ^_L| 

TP 



|2P||/,||P 



i¥'Aiirii^iiSo + ;7^ii<^Aii2ni/^iri 



1 ~ II 1 1 2p 1 1 1 1 1 n 
+ — lbA|l2 l|/i|ISo 



,P/2T 



n- 



W\\\2 



2P||/,I|P 



+ 



"■ II ||2p||,||2p , "-' II ||2p 



r2p 






2p 

1 



1 



|2P||J,||2p 



+ ^llv'Albni/illi +;^^l 



|2P||J,||2p 



IV'A 



|2P||AI|2p 



^xh \Mi +-^n^xni ii-iloo 
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and so, 






|2||^||1/P 



n 



2||i,l|2 



1 n^-^/PT^ 
+ 



'/'A||f||/i|ir'^ + ^IIV'A||tl|/i||! + ;;Y 



7757r;||(/7A||?||/l||oo + — ||¥'a||?||/i||L 



^l_l/p^ llV^A||i||/i|lI^^ + ^||V^A||2l|/i||l + ^ll'/5A||i||/i||? + 



1 



1 

n ^ nV2TVp 



IV'Allil 



+ 



1 



+ 



1 



n 



2-2/p'J'l/p „2-l/p 



lw^ll^/''llw^lp-2/P||/,||l/P I 
I'PAlb llrAlloo ll^lloo p 



with X a positive constant only depending on p and the compact support of ijj. 
Recall that for any A = (j, A;) S A, we have: 

li ^ 2^J'/2m 11^^112 ^ M^o and ||(^a||oo ^ 2^'/2m, 



I 93a 



■i/),oo 



We consider 1 < p < oo and we fix 1 < (7 < oo such that — | — = 1, so that 



^{\h-Px?'' 






l/,r.-ni/" + ^l|/^lll + ^l 



+ 



1 



1 



+ 



1 1 

+ 



2^°'nh\\il^ 



with K a positive constant depending on p, ||'(/^||i, irv'||2) iiv'iioo 
Finally, choosing p = 2, Assumption (A2) is fulfilled with 



and the compact support of ^. 



R = Cr{- + 



1 2-'°/^ 2^"^'^ 



+ 



+ 



n 



n n 



3/2 nX^/2 ' ^2 (' 



where C/j is a positive constant depending on ||/i||i, iKiHoo, H'A'llij 
of V, 

Hx = lAer 

and 8 = 1. 



and the compact support 



6.4.3 Proof of Assumption (A3) 

To shorten mathematical expressions, we denote rjx = ??a(7) ^) hi the sequel. The following inequality: 

P (|/3a - /5a| > ^Vx, I/3a| > Vx) ^ HxC 
is obvious with (^ = uj, which proves Assumption (A3) choosing 9 = — ^. 

6.4.4 Completion of the proof of [Theorem 31 

'"'|/3aI>'?a- 



Therefore we can applv [Theorem 2] the estimator /3 = (f3x1\a |> IasT ) satisfies 



A6A 



1-k' 

1 + k2 



rE 



lUeI inf <!i±4E/3A + ^E(^A-/?A)^ + E^UI+^^E^A 



11^2 



icr 1 - k2 



X^m 



X£m 



Xdm 



Aer 



^ inf ^l±42:/3f + ^2:E((/3A-/3A)^)+5:E(r?!)ULZ)5:F,, 



icr 1 - K^ 



X^m 



Xdm 



Xdrn 



Aer 



with 
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for all A = {j, k) in F, 



E((/3a - PxY) = Var(/3A) <. K {- + tt; + 



1 1 2-^n 



n T r2 

where iC is a positive constant depending on ||/i||i, ||/i||oo) llV'lli ^-^d llV'lb (see lLemma 2.ip : 
for all A = (j, A;) in F, 



,.On>/.oV(^)..„B(^).Ai^ 



and 



and 



X _ Jo2^°^' I io I Vj^. 



where X depends on e, k, 7, 



• Li? = ^((1 + ^-1/2)^1/2 + (l + 5,i/2)£i/2^i/2) ^Kfl(e-'^'joT/2 + exp(-g(eo)n||/i|| 1/2)), where 
i^ is a positive constant depending only on e and k, 

• > ifA = |r|, where |r| is the cardinal of the set F. So, we can upper bound this quantity by 
Aer 

K 2-^0, where iT is a positive constant depending only on the compact support of h and the 
compact support of ip. 

Recall that e = 1, k g]0; 1[ will be fixed in the sequel and 7 > 0, according Assumption (Al). 

It remains to compute E(??^). Let A G F. We have: 



with K depending on e, k, 7, 



^ II r 111 J 



and 



and E(A^I) = n\\h\\i + n^\\h\\{ ^ 2n^Ml ^ 



We control V{'p\) in expectation and we recall that a = K^jo'y. 



nVi^x)) ^ E(y(v9A)) + \/2aE{V{ipx))E{B\ipx)) + 3aE{B\ipx)), 
with using inequality (j6.4p . 



E{Viipx))=E{V{ipx))^K \n\\h 



n 



loo -r — ll"-!!! 



where iiT is a positive constant depending only on HV'Ib- 
Now, we focus on K{B'^{^px)) where 



B{ipx) = B{ipx) = 

n 

E 



E 



77/ — 1 

^a(- - Ui) E^(<^a(- - U)) 



n 



sup 



i=l 



ifxit - Ui) E^i^pxit - U)) 



n 



with B{ipx) = sup 



^ B{lpx) + -\\ipx\\i, 



Y,W\{t-Ui)-E^{ipx{t-U))] 



(6.36) 



Then, we have to control E{B{lpx))- Since it is a decomposition biorthogonal wavelet, ipx is a piecewise 
constant function and we can write: 



N 



v>\ 



^'^iM'^iM^ 
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where A^ E N* and for any I € {1, . . . , N}, ai, bi,ci £ M and ai < hi. It is easy to see that 



N 



N 



B{^x) ^ j;S(qlK;6,]) = Y. \^l\BiMarM)- 



1=1 



1=1 



It remains to compute E(S(lu;w)) for some interval [a;b]. 



-B(l[a;b]) = sup 



^ sup 



Y,[Ma;b]ii-Ui)-E^{lla;b]it-U))] 
1 
n 



j=l 



where for any t G M, Bt = [t — b;t — a]. 

We set B = {Bt,t G M} and for every integer n, m.n{B) = sup |{^ Pi Bt.,t G M}|. It is easy to see 

AcIR,|A|=n 

that 

mn{B) ^ 1 + ^ ^ ^ 

and so, the VC-dimension of B defined by sup{n ^ 0, nin{B) = 2"} is bounded by 2 (see Definition 6.2 

of [20|). 

By applying Lemma 6.4 of [2Qj , we obtain: 

where ET is an absolute constant. So, for any A in T, 



E(S(v.a)) ^ ^. 



n 



But, we want an upper bound of ¥,{B'^{(px)). For this, we use Theorem 11 of [T]: 

[E{B\^,))Y^' ^ K |e(S(^a)) + IIA^Ib}, 



where 



Hence, 



M= uia3isviY)\ip\{t-Ui)-¥..„{ipx{t-U))\. 



ICiCn 



with K a constant only depending on 
Finally, 



l^^\^x\\'L^K2i, 



2-^ 



lKiB\^x))^K {E{B'i^x)) + j^ 



^K ^[E(B(v.a))]' + 2^- + |J 



1 2^-J 



(6.37) 



with K a constant only depending on HV'lli and 
Then combining (|6.36p and ()6.37p yields 



E(^^) ^ A' ^ + 



Jo , Jo 



3/22,0/2 ,22.0 .42.,o ,, .2 -3/22,0/2 



n n 



3/2 



Jo^ Jo^ Jo Jo , VJ^ , ion 
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where i^ is a constant depending on 7, ||/i||i, ||/i||oo) ll^llii llV'lb and ||V'||oo) which concludes the proof 
of lTheorem 31 by setting 

i^(jo,n,T) - - + — ^^ + -^ + - + -^^ + ^. 



